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ROUNDING OF CONTINUOUS RANDOM VARIABLES AND 
OSCILLATORY ASYMPTOTICS 

By Svante Janson 
Uppsala University 

We study the characteristic function and moments of the integer- 
valued random variable [X + a\ , where X is a continuous random 
variables. The results can be regarded as exact versions of Sheppard's 
correction. Rounded variables of this type often occur as subsequence 
limits of sequences of integer-valued random variables. This leads to 
oscillatory terms in asymptotics for these variables, something that 
has often been observed, for example in the analysis of several algo- 
rithms. We give some examples, including applications to tries, digital 
search trees and Patricia tries. 



1. Introduction. Let X be a continuous random variable with charac- 
teristic function ip{t) ='Ee^''^. We consider the random variable \_X\, that 
is, X rounded downward to the nearest integer; more generally, we consider 
[X + a\ for a € M. The purpose of this note is to give some formulas for the 
characteristic function and moments of these rounded variables. (Round- 
ing upward, or to the nearest integer, is a.s. given by replacing a by a -|- 1 
and a + 1/2, resp., so such roundings are also covered.) One motivation for 
studying such variables is that they often arise as subsequence limits in dis- 
tributions of integer-valued random variables; this is discussed in Section 4 
and illustrated by several examples from the study of tries, digital search 
trees and Patricia tries. 

Our results can be regarded as exact versions of Sheppard's correction 
[24]; see Remark 2.5. The results are inspired by some special cases studied 
in detail using another method by Hitczenko and Louchard [9] and Louchard 
and Prodinger [17]. 
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2. Results. We find it convenient to shift the variables and define, an- 
ticipating (4.3) below, 

(2.1) Xc,:=[X + a\-a + l. 

Alternatively, Xa = \X + a\— a (a.s.) and, letting {x} := x— \ x\ denote the 
fractional part, X^ = X — {X + a} + 1. Thus X^ is periodic in a: X^^n = 
for n gZ. 

Theorem 2.1. Let X be a continuous random variable. Then, with no- 
tation as above, 

oo pi* _ 1 

(2.2) y,^Jt):=Ee"^"= ^ e^""" (^(t + 2^n), 

n=— oo ^ ' 

provided this sum converges; in general, (2.2) holds with the sum interpreted 
as a Cesdro sum. (We interpret the fraction as 1 ift = —2im.) 

Proofs are given in Section 3. The sum in (2.2) can be rewritten as 

oo 

(2.3) ^xSt)= E e2-"V(i + 2vrn), 

n=— oo 

where 

e'* - 1 

m-=——^{t) 
it 

is the characteristic function oi X + U with U ^U{0, 1) independent of X. 

Remark 2.2. Suppose that the moment generating function i/;(t) := 
Ee*"^ exists in a strip a < Ret < 6, with — oo <a<0<6<oo. Suppose 
further that, for example, il^{t) = 0{\t\~^) for some 5 > on each closed 
substrip a' < Ret < 6' with a < a' < b' < b. Then ip{t) = ipiit), and (2.3) 
extends by analytic continuation to 

oo 

(2.4) Vx,(t) :=Ee*^" = ^ e2^'""V'(i + 27rni), a<Ret<&, 

n=— oo 

where ip{t) := ^-^i/;(t) is the moment generating function of X + U. 

In typical applications, ip and its derivatives decrease so rapidly that 
moments of Xa can be obtained by termwise differentiation in (2.2) or (2.3). 
We let D denote differentiation. [The Odtj"*^) condition can be weakened.] 
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Theorem 2.3. // E < oo for some m > 1 and D^(p{t) = 0{\t\^^) 
for some 6 > and < k < m — 1, then 

(2.5) EX^' = E{X + Ur + Pm{a), 

where 

f3m{a) = ^ i-"^Z)"^<^(27rn) • e^™" 

is a periodic function with mean 0. In particular: 

(i) // E |X| < oo and (p{t) = 0{\t\~^) with 6>0, then 

1 , ^ Lp{2TTn) ^2™„ 



(2.6) EXa=EX + - + J2 



2 27rni 



-e 



(ii) IfEX^ < oo and tpit), ip'{t) = 0{\t\~^) with 5 > 0, then 



27r2n2 



vrn 



and 

(2.7) VarX, = VarX + ^-f;^^^ + /32(a), 

n=l 

where ^52(0) = /32(a) — (2EX + l)/3i(a) — /3i(a)^ + /q^ /3i(a)^ da is aperiodic 
function with mean 0. 

Similar formulas for higher central moments follow too; see [17] where the 
cases m = 2 and 3 are studied in detail. For comparisons with [9] and [17], 
note that X + U — ^ has characteristic function e~^^^'^(p{t) = ^^^^^0^ip{t) and 

moment generating function (if it exists) ^^^^^!p^Tp{t). 

In many cases, (p decreases so rapidly that Pm is very small; then X^ 
thus has, for every a, approximately the same moments as X + U (for m 
small, at least); the dependence of a appears only as small oscillations. This 
is somewhat surprising since each X^ is a discrete variable while X + U 
is absolutely continuous, but it can be partly explained by the following 
observation: 



Remark 2.4. If we mix X^ by taking a random and uniform on [0, 1), 

we obtain exactly the distribution oi X + U\ in other words, Xu = X + 
U. This follows from (2.2), but also (more easily) directly from (2.1) by 
conditioning on X. Hence, if a moment of Xa depends very little on a, it 
has to be close to the corresponding moment oi X + U . 
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Remark 2.5. If we ignore the periodic terms (3m in Theorem 2.3, that 
is, if we approximate the moments of by the moments of X + U, we 
obtain the well-known Sheppard^s corrections used in statistics for moments 
of grouped data [24] (see, e.g., [6], Section 27.9). The necessity of the third 
term in (2.7) — which equals — Jq (3i{a)'^ da = — Varm(a), if m{a) = KXa 
and we regard a as random and uniform in [0, 1) — as an additional correction 
in order to obtain an unbiased estimate of the variance was pointed out by 
Carver [4]; see also [5]. 

Remark 2.6. The main term E(X + U)"^ in (2.5) is easily expressed in 
terms of the moments of X. One way is through the semi-invariants Xm, for 
which we have the simple formula 

>Cm{X + U) = XmiX) + Xm([/) = Xm(X) + Bm/m, m > 2, 

where Bm denotes the Bernoulli numbers (cf. [14]). 

Example 2.7. Let X = cY, where c > is a constant and Y has the 
(Gumbel) extreme value distribution p(y <y) = exp(— exp(— y)); thus, Y = 
— InZ with Z ~ Exp(l). We have 

(2.8) ^{t) = Ee'*"^ = E Z-'*" = r(l - ict). 

Similarly, ^(t) = r(l - ct) = -ctT{-ct), ^{t) = -c(e* - l)T{-ct) and (^(t) = 
'(/'(it). Remark 2.2 applies with a = — oo, b = \/c and yields 

Ee*^"= y e^""" ~ r(l-c(t + 27rm)), Ret<l/c. 

n=—oo 

For the mean, (2.6) yields, since El^ = 7, 

EX^ = C7 + i - ^ cr(-27rnci)e^™". 

Since \T{it)\ = [ix / {tsmh{'Kt))f/'^ ~ (27r)V2|t|-i/2g-7r|t|/2^ ^j^g ^^^^^ ^^ve 
sum are small and decrease rapidly; in the important case c = 1/ In 2, |/5i (a) | < 
1.6 • 10~^ for all a. In [17], computations for the first three moments are 
given, and several other similar examples are also treated. (This function [3i 
occurs in several related contexts too; see, e.g., [8], where uj{u) = — /3i(n), 
[10], where pi{u) = -Pi{u), [13], Answers 5.2.2-46, 6.3-19 and 6.3-28 (m = 
2), where 6o{n) = — /3i(log2n), [18], Theorem 6.2 and Exercise 6.19, where 
6{u) = — /3i(ti) and [25], page 359, where -Pi(n) = — /3i(log2 ri).) 

Example 2.8. Aldous [1] found in the random assignment problem a 
limit distribution with density h{x) = e~^{e~^ — 1 + x)/(l — e~^)^, x > 0. Let 
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X have this distribution. The mean of the fractional part {X} = X + 1 — Xq 
was used in [19]. The moment generating function of X is 

POO 

^P{t)= / y e*^-^(e-^-l + x)(A; + l)e-^'^(ia; 

^0 fc=0 

k + 1 k + 1 k + 1 



■'^\k + 2-t k + l-t {k + l-tf 

oo -j^ 

where ^{z) =r'(z)/r(z) is the logarithmic derivative of the Gamma func- 
tion; ^'{z)=J2^^o{z + ky^. It follows that \ip{t)\ = |'(/'(it)| = 0(|tj"^), and 
(2.6) yields, because ln(r(l + z)r(l - z)) = ln(7rz/sin(7r2;)) and thus ^-'(1 + 
z) + ^''(1 - z) = -z"^ + 7r2/sin2(7rz), 

E{X} = EX + l- EXo = - -Y, ^^(1 + 27rni^''(l - lirm)) 



- - y (^''(1 - 27rra) + ^''(1 + 27rni)) 



-j^ oo 



2 n=l 



1 _ V f^— 



2 ^iV47r2n2 sin2(27r2ra) 



11 ^ 7r2 



+ E 



24 ^^sinh2(27r2n)' 
where the last sum is ~ 2.8 • 10~^^. 



In exceptional cases, some oscillating terms may vanish completely. 

Example 2.9. Let X ~ U{0, N), where iV > 1 is an integer. Then (p{t) = 
(e'^* — l)/(iA^t). In particular, ip(27m) = for n ^ 0. Hence, Theorem 2.3 

yields EX^ = EX + ^, and /3i(a) = 0. For < a < 1, [X + a\ = [X\ + Y, 
with Y ~ Be(a) independent of X, and a direct calculation easily yields 

VarX„ = VarX-3^ + a(l-a), < a < 1. 

Hence (2.7) holds with /32(a) = —a^ -|-a — |,0<a<l,so/32 does not vanish 
and is not negligibly small. 

Example 2.10. Let iV > 1 be an integer and let X = Y + U = Z + 
U + where Y ~ U{0,N), U,U' ~ U{0,1) and Z = [Y\ is uniform on 
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{0,...,N - 1}, with Y,U,U' independent. Then ip{t) = -(e'^* - l)(e'* - 
l)/{Nt^), and ip{2TTn) =ip'{2'nn) =0 for n 7^ 0. Consequently, Theorem 2.3 
yields EX„ = EX + \ and VarXo = VarX + ^, with /3i(a) = /32(a) = 0, 
so there are no oscillations in the first two moments. On the other hand, 
if"{2Tm) = l/(27rn)2 / and thus (^'"(27rn) = l/{2Tmf 7^ 0, n / 0; hence, /?3 
does not vanish and there are oscillations in the third moment EX^. 

Example 2.11. In Example 4.9, we study a random variable X that 
has ip{2Tm) = ip{2Tm\) = r(l — 27rni/ln2) exactly as in Example 2.7 with 
c= l/ln2, and thus the oscillating term j3i is the same as there. 

For the variance, however, there is a surprising cancellation of the oscilla- 
tions [2, 9, 22]. Indeed, as is shown in Example 4.9, YaiXa = 1 for every a, 
and thus /?2(a) in (2.7) vanishes identically; in other words, /32(a) — (2EX + 
l)/3i(a) — /3i(a)^ is constant. (Note that (52 thus does not vanish.) The nth 
Fourier coefficient of (32 is given by Theorem 2.3 and straightforward calcu- 
lations, using (4.19), (4.20) and (4.21), as 

2 / 2TTn\ 27 / 27r?i.\ 2 ^ -.^ r(j - 27rm/ln2) 
\n^2 \ ln2V ln2 2 I ln2V ln2^tl j!(2-'' " 1) 

\n^2 V ln2 y V ln2 / 

Prodinger [22] has also given a direct proof by residue calculus of the fact 
that this rather complicated expression vanishes. 
Moreover, (2.7) further shows that 




ln2 Y nsinh(27r2n/ln2) 
^ 1.000000000001237, 

another surprise; see [12] and [22]. [VarX is given by (4.22) below.] 

Example 2.12. In Example 4.7 below, we find a random variable X 
with the moment generating function (4.8). In particular, ip{2'Kn) = i/;(27rni) = 
r(l — 27rni/ln2) is the same as in Example 2.7, and thus the oscillating term 
/3i is the same as there. Since (4.8) implies 



EX = ^/''(0) =?7'(1)/7?(1) -r'(l)/ln2 = -a + 7/ln2. 
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with a := Er(2" - 1)" ~ 1.606695, Theorem 2.3 yields 

ln2 2 ln2 V hi2 / 

(cf. the asymptotics for EC/„ in [8], [13], Answer 6.3-28, and [18], Theorem 
6.2). For higher moments, see [8] and [17], Section 5.5. 

Note further that r]{2^) = by (4.7), and thus (4.8) yields ip{khi2 + 
27rni) = for integers k>l and n / 0. Hence, (2.4) yields, using (4.9), 



(2.9) 



E 2^^^^ = i,x^ [k In 2) = ^l^{k In 2) = ^^^(^ In 2) 

k 



for every integer k>l and every a. Thus, as remarked in [17], there is no 
oscillation in the exponential moments E2'^"^". (There is oscillation for other 
exponential moments, i.e., for noninteger k and for k < 0.) 

In other words, the random variables 2-^" , < a < 1, have the same mo- 
ments; note that these variables are discrete and supported on disjoint sets. 
Their mixture 2^~^^ (see Remark 2.4) is a continuous variable with the same 
moments. We thus have a striking example of distributions not determined 
by their moments. 

Example 2.13. In Example 4.8 below, we find a related random variable 
X with the moment generating function (4.13). In particular, ip{27m) = 
■i/'(27rni) = r(l — 27rni/ln2)/(l + 27rni/ln2), and Theorem 2.3 yields 



^ 7 — 13 1 \ - „ / 27rn 

^Xo, = -k — + --a + - — Vr-i 

ln2 2 ln2 ^„ V ln2 



1 e 



27ri?ia 



(cf. the asymptotics for "KSn in [16], [13], Answer 6.3-28, and [18], Theorem 
6.4). For higher moments, see [17], Section 5.2. 

As in Example 2.12, there is no oscillation in the exponential moments 
E2'=-'^"; (2.4) and (4.13) yield, in analogy with (2.9), 



E2*=^" = V'(fcln2) = ^^1^ n(2^- - 1 



2k 



for every integer A; > 1 and every a. Again, the random var iables 2^^^, < 
a < 1, have different distributions but the same moments. 



Remark 2.14. We are studying the distribution of the integer part of 
X (possibly shifted by a constant). For comparison, note that the fractional 
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part {X} has a distribution which is a probabihty measure on T = M/Z with 
Fourier coefficients (p{27rn), numbers that appear frequently in the results 
above. In particular, we have observed that the ip{2Tm) are the same in 
Examples 2.7 (with c= l/ln2), 2.11 and 2.12, which is equivalent to the 
fact that the oscillating parts of E^a are the same; we now see that this 
is also equivalent to the fact that {X} has the same distribution in these 
examples. 

3. Proofs. 

Proof of Theorem 2.1. Fix t and consider the periodic function 
h{x) := e'*~^*'f^'J' and its Fejer sums h]\[ := K]\i * h, where is the Fejer 
kernel Kn{x) = En=-7V ^w(^)e^™'' with K^(n) = 1 - n/{N + 1). Note 
that \h]\f \ < 1 and that h^ix) h{x) as iV — > oo if x ^ Z (see, e.g., [26], 
Theorem 111.(3.4)). 

We have 

^tX^ ^ ^t{X-{X+a}+l) ^ ^ 

and thus, by dominated convergence, 

(3.1) Ee^*^" = lim E(e"^/i7v(X + a)). 

The Fourier coefficients of h are 

1 . _ . oii _ 1 



n(n) = e e ax 



e 



1(1 i{t + 27rn) 

(interpreted as 1 if t + 27rn = 0). Thus 

hN{x)= J2 KN{n)h{n)e^-'^- = ^ /^jv(n) e^""" 

n=-N n=-N + 

and 

^ pi* - 1 

E(e"^/l^(X + a))= y KN{n)-^ j_^^i(t+2nn)X+2nina 

= E ^7v(n)-^-— -e2™Xt + 27rn). 
n^N i(i + 27rn) 

The right-hand side is the A^th Cesaro mean of the sum in (2.2), and the 
result follows by (3.1) (cf., e.g., [26], Section 111.1). □ 

Proof of Theorem 2.3. Since \Xa\ < \X\ + 1, E|Xa|™ < oo; hence, 
if, ipx^ and ip are m times continuously differentiable. To see (2.5), we thus 
have to show that we can differentiate (2.3) termwise m times. 
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The assumption implies D^ip{t) = 0{\t\~^^^) for < m — 1, and thus (2.3) 
can be differentiated termwise m — 1 times: 

oo 

(3.2) L''"-Vxjt)= e2™"D™-V(t + 2vrn). 

n=— oo 

By Leibniz's rule, 

D™-V(i) = ^^i?™-V(t) + ^m(t), 
\t 

where $m involves if, . . . , D"^~'^ip and the assumption yields D^„i{t) = 
0{\t\-^-^). Hence, for < e < 1 and n / 0, 

L''"- V(27rn + e) - D"'" V(2^n) 

„i£ 1 

-vm— 1 



Z)™- V(2vrn + e) + ^>,„(27rn + e) - $m(2vrn) 



i(27rn + e) 
= 0(e|nri-'5), 
so by (3.2) and dominated convergence, 

r E XT = (0) = lim ""-Vx.W-o--Vx.(o) 

oo 
n=— oo 

The term with n = is D'^ip{0) = i'"E(X + U)"^, and (2.5) follows. 
For n^O, ^(27rn) =0, (p'{2Trn) = (^(27rn)/(27m) and 

^"(2vrn) = i^i^ - 2^^^ + 2^^, 
27rn (27rn)^ 27rn 

and the formulas for Pi and /32 used in (i) and (ii) follow; recall also that 
VarC/ = ^. 

Finally, Jq \Pi{a)\'^ da is evaluated by Parseval's relation. □ 



4. Asymptotics of integer- valued random variables. A common situation 
in the study of integer- valued random variables is the following: Yi,Y2, . . . 
is a sequence of integer- valued random variables, ai, 02, ... is a sequence of 
real numbers and X is a random variable such that 

(4.1) ¥iYn-an<Xn)=F{X <Xn)+o{l) aS n ^ OO, 

for every sequence Xn such that Xn + cin G equivalently, for every sequence 
kn of integers, 

(4.2) F{Yn < kn) = Fix + an <kn) + o{l) asn^oo. 
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Some examples are given below and in, for example, [17] and [18]; in partic- 
ular, [18], Figures 5.5 and 6.3, provide nice illustrations. 

Note that if (4.1) would hold for any real numbers x„, we would have 

Yn — an X; however, for integer- valued variables, such convergence in 
distribution is possible only if the fractional parts {a„} converge as elements 
of the circle M/Z (i.e., we identify and 1). In general, it is thus impossible 
to have convergence in distribution, and (4.1) is a natural substitute. For 
example, (4.1) should be expected for a sequence Yn of random variables 
that arise in some natural way and have bounded variances. [Of course, if 
the variance tends to oo, we can have convergence of (Yn — a^)/ Var(l^)^/^ 
to a continuous random variable; we do not consider that case in this paper.] 
Note that the centering constants a„ can be regarded as approximations of 
the median (and typically of the mean as well). 

Before proceeding, let us note that (4.1) and (4.2) can appear in differ- 
ent forms. For convenience, we state some of these versions in the following 
simple lemma; the proof is left to the reader, [dxv denotes the total vari- 
ation distance; for integer- valued random variables X and Y, dTv{X,Y) = 
ij2MX = k)-F{Y = k)\.] 

Lemma 4.1. If Yn are integer-valued random variables, an are real num- 
bers and X is a random variable, the following are equivalent: 

(i) (4.1) holds for every sequence Xn such that Xn + an 

(ii) (4.1) holds for every bounded sequence Xn such that Xn + an (z'^- 

(iii) (4.2) holds for every sequence kn of integers. 

(iv) (4.2) holds for every sequence kn of integers such that kn = an + 0{l). 

(v) sup^g2-a„ I ^(Xn - «„ < x) - P{X < x)] ^ as n^oo. 

(vi) supfcgg I P{Yn <k) - F{X + a„ < A;)] ^ as n ^ oo. 

(vii) dTy{Yn, \X + a„] ) ^ as oo. 

Although (4.1) typically prohibits convergence in distribution, it implies 
convergence in distribution of suitable subsequences. Indeed, suppose that 
X is a continuous random variable. It is easily checked that if (4.1) holds, 
then, along any subsequence such that {a„} — > a € [0, 1], 

(4.3) Yn-an^\X + a]-a = X^. 

Conversely, if (4.3) holds for every subsequence with {a^} a, then (4.1) 
holds. 

Remark 4.2. Lemma 4.1 (vii) can be written d^yiYn — an,X^an}) ~^ 
0. If, further, m > 1 and {Yn — an)"^ is uniformly integrable, this implies 
¥,{Yn — On)^ = E^{^ I +o(l), SO asymptotics for moments of Yn follow from 
Theorem 2.3. Uniform integrability is easily verified in many examples, but 
for simplicity we omit discussions of it in our examples. 
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Example 4.3. A simple example is the maximum of i.i.d. geometric 
random variables. For convenience, we shift the variables by 1; thus, let 
Yn = max(Zi, . . . , where Zi are independent and — 1 ~ Ge{p) with 

< p < 1, that is, W{Zi = k) =p{l -p)^~^, k = l,2, With q = l-p and 

a„ = logi^q n, it is easily seen that if 3;„ + a„ S Z+, then 

P(yn<X„ + an) = (l-g""+'^"r 

= (l-g^V^)" 

= exp(-g'^") +o(l), 

uniformly in such x„, and (4.1) holds with X having the extreme value dis- 
tribution P(X <x)= exp(— g^'); this is the distribution in Example 2.7 with 
c = 1/1 lng|. Hence, (4.3) also holds with this X. See further [15], Chapter 1. 

Example 4.4. Let Dn be the depth of a trie [25], Section 1.1, con- 
structed from n independent random infinite strings of the m symbols 0, . . . , 
m — 1, where the symbols in each string are chosen independently and uni- 
formly. It is easily shown (see, e.g., [20] or [18], Section 5.5) that 

¥{Dn<k) = {l-m-''Y~^, k>l. 

Dn thus have the same distribution as l^-i in Example 4.3, with q = 1/m. 
Hence, (4.1) and (4.3) hold with a„ = log„n and X having the extreme 
value distribution in Example 2.7 with c= 1/lnm. See further [18], Section 
5.5 and [17], Sections 4.1 and 5.1. 

Example 4.5. Let Hn be the height {i.e., maximum depth) of the trie 
in Example 4.4. Then (see, e.g., [7, 20], [3], Example 6.2.1, [18], Section 5.6) 
(4.1) and (4.3) hold with a„ = 21og^n — log^2 and X as in Example 4.4 
having the extreme value distribution in Example 2.7 with c= 1/lnm. 

Several other similar (but sometimes more complicated) examples from 
analyses of algorithms are studied in, for example, [17, 18, 25] and the ref- 
erences given therein. 

These references also contain related examples where it is not obvious 
whether they fit in the above framework or not. Note that if F is the distri- 
bution function of X, then (4.1) and (4.2) are equivalent to 

(4.4) HYn<kn)=F{kn-an) + o{l) asn^oo, 

for every sequence /c„ of integers. However, there are several cases where (4.4) 
has been proved for some continuous function F : R ^ [0, 1] that satisfies 
\\m.x~^~ooF{x) = and \\mx^ooF{x) = 1, but such that it is not obvious 
that F is monotone. [If it is, it is a distribution function, and thus (4.1) 
holds for X having this distribution.] 

In some cases, this problem is solved by the following lemma: 
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Lemma 4.6. Let Yn be a sequence of integer-valued random variables 
that is monotone: Yn < l^+i for n > 1. Suppose further that (4.4) holds for 
a right- continuous function F with lim^-^-oo F{x) = and lim^j^oo F'ix) = 1, 
and a sequence a„ such that On — > +oo and On+i — — > as oo. Then 
F is a distribution function. 

Proof. We have to show that F is monotone. Let x < y he two real 
numbers. For an integer k> a^ + y, let ■= max{n : a„ < A; — x} and := 
max{n:a„ < k — y}. The assumptions imply that > and that, as 
k — > oo, Uk, n'l^ oo and < {k — x) — On,, < Onfe+i — (in,, — ^ and, similarly, 
< (fc — y) — a„/_ — > 0. Hence, it follows from (4.4) that, as — >■ oo, 

HYn^ <k)= F{k - a„J + o(l) ^ F(x), 

(4.5) 

P(y,, <k)= F{k - ) + 0(1) ^ F(2/). 

Furthermore, > y„/^, and thus P(l^nfc <k)< f{Yn'^^ < ^) for each k. Con- 
sequently, (4.5) implies F{x) < F{y). Thus F is monotone. □ 

Example 4.7. Let be a pure birth process with ?7o = and P(C/n+i = 
/c + l I [/„ = /c) =2^^. This process has been studied by [8] as approximate 
counting. The same random variables (more precisely, random variables with 
the same distributions) have also been studied as the number of compar- 
isons in an unsuccessful search in a random binary digital search tree with 
n records; see [13, 18] or [25] for definitions. (To see the equivalence, con- 
struct a digital search tree from n random binary strings, with bits i.i.d. and 
Be(l/2), and search (a.s. unsuccessfully) for another random binary string 
w. Say that this requires k comparisons. If the tree is enlarged by a new 
string Wn+i, the search depth of w increases from A; to + 1 if and only if 
the first k bits of w and Wn+i coincide. See also [21].) 

It is shown in [8], Proposition 3 (in an equivalent form) and in [18], Section 
6.4, that (4.4) holds for Un with a„ = log2 n and 

oo T-, 

(4.6) F(x) = 5]-fexp(-2-(--^)), 

j=o ^ 

where Rj := {-ly TVk=i{'^^ - 1)"^ and Q = EfLoRj- Thus F is an infinite 
linear combination of translates of the extreme value distribution function 
of y/ln2 in Example 2.7; note, however, that some Rj are negative, so 
this is not a simple mixture of these distributions. This is a case where 
Lemma 4.6 applies, and thus F is a distribution function. [It seems difficult 
to give an analytic proof that the sum in (4.6) is monotone.] Let X have 
this distribution. 
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Using the identity [16, 18] 

oo oo y \ 

(4.7) E^.-^'=^(^)^=n 1-i ' ^ec, 

it follows from (2.8) that Lp{t) = (5"i?7(e'*)r(l -it/ln2). Moreover, Y has the 
moment generating function '^yif) = r(l — t/ln2) in the region Ret < ln2, 
and it follows that X has, in this domain, the moment generating function 

(4.8) m = E ^e%(l - t/ln2) = ^r(l - t/ln2). 

The function r\ is entire, with zeros at 2*^, fc = 1, 2, . . . , and thus r/(e*) has a 

zero at every pole of r(l — t/ln2), since these are at t = A:ln2, k = 1,2, 

Consequently, the right-hand side of (4.8) is entire, which implies that the 
moment generating function exists for all t and is given by (4.8). [This shows 
that there is significant cancellation in (4.6); the tail 1 — F{x) decreases 
super-exponentially as a: — > oo, while the individual terms in the sum ap- 
proach their limits Rj/Q at the rate 2~^.] The values of ?/'(t) at the poles of 
r(l — t/ In 2) are easily calculated: For k = 1,2, . . . , 

(4.9) ,„ 2) = !?|^2' (- ij) " R«._. r = n (2^ - 1). 

It follows by the results of Example 2.12 and (easily verified) uniform in- 
tegrability that there are oscillatory terms in the asymptotics for E Un — see 
[8], [13], Answer 6.3-28, and [18], Theorem 6.2-— but not for E2*^'^" when k 
is a positive integer. Indeed, it is easy to find exact expressions for these 
exponential moments by recursion (at least for small k); they are poly- 
nomials in n, and the first three are £2^^" = n + 1, E2'^^" = 3("'^^) + 1, 
E23^" = 21 ("+^) + 7n + 1 (cf. [8], Proposition 0). 

Note that the moments of 2^" thus converge to the values in (2.9), al- 
though the distribution does not converge. Thus the method of moment fails 
for these variables. 

Finally, we remark that the problems can be generalized to other bases by 
taking instead p(C/„+i = k + l\Un = k) = a~'^, where a > 1 is an arbitrary 
real for approximate counting (see [8]) and a = m is an integer for m-ary 
digital search trees; see [13]. Similar results hold for the generalization, re- 
placing 2 by a in (4.6), (4.7), (4.8), (4.9) and elsewhere. 

Example 4.8. Let Sn be the number of comparisons in a successful 
search in a random binary digital search tree with n records [13, 16, 18]. 
Then, 

(4.10) Sn = l + Ul^, 
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where (C/n)(f is as in Example 4.7, and I„ is uniform on {0,...,n — 1} 
and independent of (f7„,). Louchard [16] has proved (4.4) with the hmiting 
function 



(4.11) Fix) = 2^(l- 1-y ^ 1 y ^^(1 - e-2-^+'+^). 

Since (4.10) impUes that Sn is (stochasticaUy) monotone, Lemma 4.6 apphes, 
and this function is a distribution function. 

Alternatively, we may use (4.10) and the results of Example 4.7. If we let 
denote the random variable with distribution function given by (4.6), 
and still let an = log2?^, then (4.10) and (4.2) yield, for every sequence fc„ = 
On + Xn of integers, as n—i-oo, 

^ n—l 

FiSn < K) = - V ¥{U^ <kn-l) 
n 

^ n—l 



(4.12) "^^1=0 

n—l 

= - V P(Xu + am-an + l<Xn)+ o(l) 

n ^„ 

= p(Xu + 07„ - an + 1 < x„) + o(l), 
with /„ independent of X^- Since /„,/n^C/~[/(0,l), we have 

a/„ -an = log2(/n/"')-^log2C/ = -^/ln2, 

with Z := - InC/ ~ Exp(l). Hence, (4.12) yields (4.1) with X:=Xu-Z/ln2 + 
1, where Z and X^ are independent. [This X thus must have the distribu- 
tion function (4.11), which also can easily be verified from (4.6).] By (4.8), 
X has the moment generating function 

(4.13) V(t) = ^x(t) g!^^et ^(^*)r(l-Vln2) Ret>-ln2. 

V J H^y ) "^^"^ ^i + i/in2 r/(l)(l + t/ln2) ' 

In analogy with the results above for unsuccessful search, there is oscilla- 
tion in the asymptotics for E5n — see [16], [13], Answer 6.3-28, [18], Theo- 
rem 6.4, and [17], Section 5.2 — but not for the exponential moments E2'''^", 
which are polynomials in n by (4.10) and the corresponding result for C/„. 

Example 4.9. Let, for s > 0, Ys := Y.T=ihk, where the Igk ~ Be(l - 
e~*/^ ) are independent. Thus Yg has the probability generating function 

oo 

D{s,u) :=En^= = \{{e-''^' +u{l-e~'/^')). 

k=l 



P(Xu + o™</c„-l)+o(l) 
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These variables occur in the study of the depth in a random binary Patricia 
trie; see [13] or [25] for definitions. Indeed, if Z)„ is this depth in a Patricia 
trie with n records, then Dn can be obtained by taking a random binary 
string w and comparing it to n — 1 other random binary strings; -D„ equals 
the number of indices k such that there is at least one of the strings that is 
equal to w in the first k — 1 positions but differs in the A;th. If we Poissonize 
and instead compare with Po(s) other strings, we obtain 1^. [The number 
of strings that first differ in the kth position is Po(s/2'^), and these numbers 
are independent.] Note further that depoissonization is easy because the 
variables are monotone: if n± = n it n^/^ and Nn± ~ Po{n±), then P(A'„_ < 
n — 1 < Nn+) 1 and thus ¥(Yn- < Dn < ^n+) 1; hence, the asymptotics 
for Yg below yield the same results for Dn- 

The same variables occur as the number of different values in a sample of 
n values from a geometric Ge(l/2) distribution [2] (this is our Dn+i) and 
in the study of distinct parts in a random decomposition [9]. 

Let Bra{s) := F{Ys = m); thus < < 1 and D{s, u) = E™, ^m(s)w™. 

It is shown in [23] by a manipulation of generating functions that (see [23], 
(35)) 

W{Ys<k)=j2Bn.{s/2'^'-ne'/''^'-'^, 

m=Q 

and thus, for any bounded Xs with log2 s + G Z, 

(4.14) F{Ys-log2S<Xs) = F{xs)+o{l) ass^cx), 
with 

oo 

(4.15) F{x) = J2 5m(2™~^~^)e-2'""'-^ , -oo < x < cx). 

Thus (4.4) holds with a„ = log2 n. 

It is easily seen that each Bm{s) is a continuous function, and thus F{x) is 
continuous (by dominated convergence) with F{x) ^ as x ^ — oo (by dom- 
inated convergence) and -F(x) — > 1 as x — > oo [because F{x) < 1 by (4.14) 
and F{x) > i?o(2~^~^)e~^ = ""]. Lemma 4.6 shows that F is a dis- 
tribution function of some random variable X. 

To see that X has an entire moment generating function, note first that 
for t > 0, 

fc>log2 S fc>log2 S 

<exp((e*-l) <exp(2(e*-l)), 

V fc>log2 S / 
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so by (4.3) and Fatou's lemma, Ee*''''" < oo and thus Ke^^ < oo. The case 
t <0 can be treated similarly, but we will instead perform an exact compu- 
tation. If n > 0, then, by Fubini and (4.15), with x = y + m — 1 and s = 2~^, 

/•oo roo 

V'(-n) :=Ee-"^ =Em / e-''''dx = u e-"^F(x) fix 

Jx J-oo 

°° POO _ 

J2 / e-"^B„,(2™-i-^)e-2'" ^dx 

°° roo _ 

e-"("'-^) / e-''yBm{2-y)e-^ " dy 

™_n J — OO 



U 

m=0 
oo 

U 

m=0 
roo 

-CXD 
o« /-OO 



lie 
hi^ 



/•oo 

/ s'''^'''^-^D{s,e-'')e-' ds< oo, 
Jo 



since < D{s,e ") < 1. Hence, ip is an entire function with, by analytic 
continuation, 

fp—t roo 

(4.16) i^{t) = —, / D{s,e')s-''^'^'^-^e-' ds, Ret<0. 

In 2 Jo 

Using integration by parts, we can obtain similar formulas (involving also 
dD/ds and possibly higher derivatives) that extend into the right half-plane; 
in particular, this yields formulas for the characteristic function ip{t) = ipijt) 
that can be used to find the moments of X and the other constants in 
the formulas in Theorem 2.3. However, we shall leave these formulas to the 
reader and use a slightly different approach, where we use (4.16) for —e + it 
and let e \ to find il^ijt) and its derivatives (at least for the t that we 
need). Let 

oo 

(4.17) m(s) :=Ey, = ^(1-6-^/2*=)^ ^ > 

k=l 

and note that < m(s) < s. Further, since 1^ is a sum of independent 
Bernoulli variables, Var Yg < Ey^i and thus Eys(ys — 1) = Var + m(s)^ — 
m{s) < m[s)'^ < s^. Consequently, by a standard Taylor expansion of the 
probability generating function D{s,u), 

D{s,u) = 1 + m{s){u - 1) + 0{s^\u - Ip), |n| < 1. 



Evaluating (4.16) at t = —e + 27rni with n (zZ and e > thus yields 



fpS roo 

^it) = -P- / (1 + - l)m(s) + 0(sV))s-*/i'^2-i^ 
mi Jo 



'ds, 
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(4.18) = -j^(^e'T{-t/ ln2) - £ m{s)s~'/^''^~^e~' ds^ + 0{e^' 

= e'T{l - t/ln2) + e:^g{-t/ln2) + 0{e^), 
where, for Re^; > 0, using (4.17), 

POO 

g{z) := / 7n(s)s^"-^e"^ds 



^ poo , 
oo 

^(i_(i + 2-^)-)r(z) 



k=l 

CO oo 



(4.19) =-r(^)EE /)2-^' 

k=lj=l ^ 



i!(2^-l)' 

we define g{z) for z 7^ — 1, —2, ... by the latter sum. Letting e ^ in (4.18) 
and recalhng t = —e + 27rni, we find 

(4.20) ip{2'Kn) = V^(27rni) = r(l - 27rm/ In 2) 
as in several other examples, and (first taking d/de) 

, , , , „ / 27m \ 1 / 27rn \ 27rni / 27rn \ 

(4.21) *'(2™) = -r(i - _■) - _r'(i - _i) - ^,[-^^). 

In particular, EX = V^'(O) = 7/ln2 - 1 -0.16725382272. We similarly find 

^2 .n^, r"(i) ^ 7 , ^ 25(0) 



EX^ = (0) = - 2-^ + 1 



ln2 2 ln2 ln2 ' 

where 

^(0) = E (J n = E 11^(1 + 2"') « 0.86887665, 

and thus 

(4.22) Var X = ^ w 0.916666666667904. 

^ ' 61n2 2 ln2 

As remarked in Example 2.11, VarX + is extremely close to 1. 
It follows from Theorem 2.1 and Remark 4.2 that 

E - log2 s = E Xiog^ , + 0(1) = 7/ In 2 - 1/2 + /3i (logs s) + o{l) 
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(cf. [13], Answers 6.3-31 and 5.2.2-48, [9, 10]; recall also (4.17)). 
For the variance, we find directly from the definition 

oo 

Vary, = £(l-e-^/2^)e-^/2' 

oo 
k=l 

since the sum telescopes. Thus, as s — > oo, Var 1. Hence, by Remark 4.2 
(uniform square integrability is easily verified), VarXa = 1 for every a; see 
[9]. As discussed in Example 2.11, there is thus no oscillation in the variance. 
For higher moments, see [9]. 

This example can also be generalized to bases other than 2 [2, 23]. Note, 
however, that the cancellation of the oscillations in the variance of is 
special for base 2 [2] [and for 2^/'^, k > 1, which does not make sense for 
Patricia tries but occurs for sampling from a geometric distribution Ge{p) 
with q = l-p=l/2^/'']. 

5. Final remarks. It is easily checked that if X has the distribution func- 
tion F{x) := P(X < x), then X^ has the distribution function 

(5.1) r{Xa<x) = F{[x + a\-a). 

Now suppose that (4.4) holds with some continuous function F, not neces- 
sarily a distribution function, and consider a subsequence such that {a„} — > 
a G [0,1]. Then, 

HYn -an<x) = F{Yn < X + ttn) 

= F{Yn< [x + an\) 

= F([x + a„J -a„) + o(l) 

= F{lx + {an}\-{an})+o{l) 

^F{[x + a\ - a) 

when x + a ^Tj. Hence y„ — a„ — ^ Xa, as in (4.3), where Xa has the distri- 
bution given by (5.1). In particular, the right-hand side of (5.1) is a distri- 
bution function, even if F is not (provided a is a limit point of {a„}); this 
(if valid for all a) is just equivalent to F{x) < F{x + 1) for all x. 

Note further that if F has bounded variation, we can define ip{t) := 
/ e^^^dF(x) and moments / x'^dF{x), and Theorems 2.1 and 2.3 still hold, 
with natural interpretations, by the same proofs. Thus, for example, results 
about asymptotics for moments of Yn can be obtained without knowing 
whether F is a distribution function or not. (This is done, with different 
methods, in [17, 18, 25].) 
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Nevertheless, in each example where (4.4) holds for some we consider 
it to be an interesting question whether or not the limit F is a distribution 
function. We have given several examples above where the answer is affirma- 
tive. One example where the problem is open is provided by leader election 
in the biased case (p^ 1/2) [11]. Further, there are several examples where 
it seems that this question has not yet been studied; the interested reader 
can start by investigating the remaining examples in [17]. 

Acknowledgments. I would like to thank Dag Jonsson, Guy Louchard 
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